Background

A long time ago, in a galaxy far, far away…

For many people, Star Wars has been one of the greatest movie series of all time. And even if you haven’t watched it, you certainly know some of the music it contains. Over the years, the Star Wars franchise expanded into various movies and television series, such as the Mandalorian, Clone Wars and Star Wars Rebels. In this portfolio, a distinction is made between the music of three major Star Wars series. These series are The Star Wars movies, composed by John Williams, the Clone Wars series, composed by Kevin Kiner, and the Mandalorian series, composed by Ludwig Goransson. The corpus exists of three times forty songs, forty songs for each of these three composers. I think it would be interesting to look into the differences between their music and to see if it is possible to identify the composer based on the song features.

Fig 1: John Williams, Kevin Kiner and Ludwig Goransson

Fig 1: John Williams, Kevin Kiner and Ludwig Goransson

I expect that the difference between Goransson and Williams is the most noticeable, because Star Wars is a space opera while the Mandalorian defines itself as a space Western. The differences between Kiner and Williams and Kiner and Goransson are more difficult to find, and thus far no one has specifically looked into it. While I was listening to the music myself I found that Kiner’s music felt more powerful and energetic than the music of the other two composers. The corpus tries to cover every movie or episode, but since a movie contains more music than a single episode it was hard to select the right songs. Therefore, I used 5 songs from every Star Wars movie, except for one. This was to maintain an equally divided song selection between the composers. From the Clone Wars series, 10 songs from every season were selected and for the Mandalorian I selected 5 songs per episode of season 1. The songs were randomly selected to avoid a certain bias. Below you can find an overview of the selection.

John Williams Kevin Kiner Ludwig Goransson
5x The Rise of Sky Walker 10x Clone Wars I 5x The Mandalorian Ch1
5x The Last Jedi 10x Clone Wars II 5x The Mandalorian Ch2
5x The Force Awakens 10x Clone Wars III 5x The Mandalorian Ch3
5x The Phantom Menace 10x Clone Wars IV 5x The Mandalorian Ch4
5x Revenge of the Sith 5x The Mandalorian Ch5
5x A New Hope 5x The Mandalorian Ch6
5x Attack of the Clones 5x The Mandalorian Ch7
5x The Empire Strikes Back 5x The Mandalorian Ch8

A typical Star Wars movie composition is Duel of the Fates. Duel of the Fates emphasises the lower register of the strings with a repeated moving line, bright bursts of brass and the choir provides this piece with its glory and magnificence.The loudness and energy of the song is a recurring theme in the Star Wars movies. For the Mandalorian, “The Mandalorian” is a very typical song because it is one of the most similar songs as compared to the Star Wars movies. An outlier is Cafe Coruscant from Kiner, which sounds more like a normal radio song than a Star Wars song. In this portfolio we will look into a few more outliers.

The main limitation of this analysis and corpus is that there is little to compare on the individual song level. The composers all have totally different songs, they did not use a theme from another composer or something like that. Therefore looking at the individual song level is not really reliable, it is similar as comparing apples with oranges. Also not all songs are covered in this corpus, meaning that I could accidentally left out important outliers or typical songs.

You can find the exact corpus here.

Track-level features

John Williams is a huge fan of acousticness, but not of energy


In the visualization you can see the acousticness and instrumentalness plotted against each other. The color of the dots matches the energy, with the lighter colors having a higher energy. The size of the dots matches the loudness of the track. The bigger the dot, the louder the track is. As you can see in the visualization, the music of John Williams stands out as compared to the other two composers, in two ways. First, Williams uses a lot of instrumental and acoustic music, while the music of Kiner and Goransson varies more in acousticness. All of the three composers have a high instrumentalness score, but the acousticness score varies. Second, William’s music has a lower energy score than Kiner’s and Goransson’s. William’s tracks are darker, indicating lower energy use.

Sing or sing not, there is no speech


In this graph you find the speechiness of each track sorted in ascending order. It is commonly known that Star Wars song do not contain any singing, so I was surprised when I found even a few tracks that did contain any speech according to Spotify. Unfortunately, I have to disappoint you, the Star Wars songs do not contain speech. The Spotify API wrongly thinks that Speederbikes and Night Riders contain speech. This does however have an influence on my research question whether we can distinguish our composers. In the case that a song is labeled with a speechiness of 0.5 or higher, there is a 19/25 chance that it is a song of Ludwig Goransson. This is a chance of 76%, which is very high considering we are just looking at one feature.

Pitch and Timbre

Taking a look into John Williams’ use of pitch and timbre: a mess


Here you find the cepstrograms of Finale composed by John Williams. Of course this is not a total accurate representation of his structure style, meaning that the comparison made between the cepstrograms of the different composers is not entirely reliable. The songs in the corpus of each composers vary a lot, so I tried to illustrate an average song for each composer. That being said, in the cepstrogram of the chroma features it is hard to find a structure. This is very common for Williams’ music since he shifts pitches all the time. Finale is a general song in the Star Wars movies, and you can hear its theme multiple times in the movies. Regarding the timbre features you can see that c02 and c03 are mostly used. However, these timbre features are used during the whole song, and it doesn’t switch to other timbre features very often. In the cepstrograms you can see two white straight lines, these lines show the Star Wars theme that occurs in the Finale song. In the chroma cepstrogram it is not noticeable that this theme occurs, but you can clearly see in the timbre cepstrogram that c01 wasn’t used before but started to when this theme joined in. After this theme you see a small decline in the usage of c01.

Taking a look into Kevin Kiner’s use of pitch and timbre: a clear structure


Here you find the cepstrograms of Prison Cell Talk by Kevin Kiner. In contrast to Williams’ Finale, you clearly see a structure in the chroma cepstrorgram. The song is divided into 5 parts, first the intro which slowly shifts into the V1. After a short time V1 shifts to V2, which is the main part of the song with little variation. Then the song starts to fade in V3, which ends in the Outro. After around 50 seconds, it is harder to distinguish the parts. So V2, V3 and the Outro shift very smoothly into each other which is almost unrecognizable. Since these shifts are very closely related to each other in terms of pitch it is harder to hear the difference.
If you apply the same structure to the timbre cepstrogram you can’t find the same structure. There is a small outlier in V3 at around 140 seconds, this is the part where the main instrument stops for around 10 seconds and then continues into another variation.

Taking a look into Ludwig Goransson’s use of pitch and timbre: a medium structure


Here you find the cepstrograms of The Mandalorian by Ludwig Goransson. The chroma cepstrogram aligns well with the song. Throughout the song there is a varying pitch which matches the rhythm, so all those vertical lines after every second is really what is going on in this song. The intro at the beginning has a different pitch than the main part, which the cepstrogram clearly shows. Then comes the main part which varies in pitch a lot, but the main pitch C is noticeable in the cepstrogram. At around 180 seconds the outro starts in six different pitches. Regarding the timbre cepstrogram, you can see that throughout the song there is a main timbre feature c02. During the intro there is a little bit c05 and c06 noticeable, but the main timbre feature stays c02. In the outro the timbre switches from c02 to c03.

Key and Chords

Kiner and Goransson forgot their A key, Williams once again stands out.


One of the Spotify features is key, where every song is assigned the main key of the song. When talking about this graph, only the main keys of the songs are discussed. In this graph you can see how many times a composer used a certain key for their song. As you can see, Williams mostly uses A, D, F and G. Meanwhile Kiner mostly uses A#, C, D and F. Lastly Goransson mostly uses C, F and G. Key F is popular for all three of the composers, but the A key is only used by Williams. Williams varies more in the usage of keys in general.

Focussing on Kiner and Goransson: Keygrams


Here you find a keygram of Ludwig Goransson’s The Baby and a keygram of Chase in the Valley by Kevin Kiner. The darker the color, the more a certain key is used. The graph compares the averaged chroma vectors against templates to yield a keygram. I chose the composition of The Baby and Chase in the Valley because they are very typical for their composers. The keys of these songs are known and that is why I could see if the Spotify API correctly got the keys right.

In the first graph you see that Kiner start with G#-minor, then through D-minor, A-minor and F-minor back to A-minor. When I checked this results, the Spotify API turned to be wrong, Chase in the Valley is written in B-flat major. When you look at the keygram, this is barely noticeable. In the middle of the composition you see a huge decrease in key usage, that is the part where the violins come in. At the end of he song the music turns really loud and has multiple instruments, this would explain why the keys are nearly the same as the neighbors at the end. In the second graph, The Baby, you see the shift from C-major through G-major to C-minor, then from C-minor to D-minor. In the middle of the composition you see a huge increase in key usage. While listening to the song you find that in this part the pitch goes up and the music becomes more intense. After this the music slowly shifts towards D-minor. This time the Spotify API detected the keys correctly.

Temporal features

Global overview of tempo usage


Here you find a histogram of the tempo usage per composer. As you can see, a tempo between around 75 and 85 are most used by all three the composers. Williams tempo is overall slower than that of Kiner and Goransson. The average tempo of Williams is around 100 BPM, while the average tempo of Kiner and Goransson is around 109 BPM. To illustrate some outliers, I added The Magic Tree by John Williams and Obi-Wan and Ahsoka Argue by Kevin Kiner. Since I listened to both composers a lot in the last few weeks, the tempo difference for each of them felt weird. Williams’ music is always calm and soothing, even in the battle parts, but The Magic Tree is faster and makes you more alert. In contrast Kiner’s music always gives me a feeling that something terrible is about to happen, but not in Obi-Wan and Ahsoka Argue. Another fun fact, the average tempo of all three composers lies below the preferred tempo as we have read in Moelants (2002).

Kiner’s outlier saves the day


Here you find two cyclic tempograms of tracks by Kevin Kiner, Cafe Coruscant and Obi-Wan and Ahsoka Argue. The Window size is set to 2, and the hop-size is set to 1, otherwise my computer crashed. I chose to focus on Kiner because all three composers use a lot of variance in tempo in general, but Kiner has a few songs with a main tempo. One of these songs is Cafe Coruscant, which can be seen as an outlier, while Obi-Wan and Ahsoka Argue is more a typical song. The tempograms of Kiner overall are a mess because he varies the tempo a lot, an outlier such as Cafe Coruscant is therefore easy to recognize. The song has a steady tempo with little variance and that is why you get this tempogram, which is a little bit better. The overall tempo of Cafe Coruscant is between 140 and 150 BPM. This matches with the Spotify track-level tempo of Cafe Coruscant, which is 143.905 BPM. However, if you look at Obi-Wan and Ahsoka Argue, the tempogram is all over the place. The tempogram starts at 80 BPM, while according to Spotify, the tempo of the song is 47.996 BPM. It isn’t then surprising that the tempo confidence of this song was 0.01, while the tempo confidence of Cafe Coruscant was 0.464.

Important note: I tried the tempograms with multiple songs to make sure that I didn’t choose the wrong songs to compare, but it turned out that all of them were horrible. Therefore I stayed with my initial choice to choose Obi-Wan and Ahsoka Argue.

Williams’ music lasts the longest


Another result is the difference in song duration. Here you see the distribution of the song duration per composer. As you can see, Williams overall duration of songs is longer and has a few outliers as well. In contrast, Kiner and Goransson take less time per song. Williams wrote music for the Star Wars movies, so logically his songs are longer. The duration of Williams’ songs are between 1:44 and 13:06. As compared to Kiner, Goransson varies more in duration. The duration of Kiner’s songs are between 1:13 and 3:40, while Goransson’s songs last between 1:08 and 6:06. All of Kiner’s songs fall within the interval of Goransson’s songs, which makes it hard to distinguish these two composers by only duration. From this graph we can conclude that if you listen to a Star Wars song and it takes more than four minutes, it is likely to be a song of Williams, and impossible to be a song of Kiner. If you listen to a song that takes only one and a half minutes, you know for sure that it can’t be a song of Williams.

Classification

Not the Magic Tree, but the Random Forest


In this graph you find the feature importance as computed by the Random Forest Classifier. The accuracy is around 0.91, which is very good actually. The results of the classifier are in line with my own results. It doesn’t surprise me at all that acousticness is an important feature to classify the composers, since Williams uses a lot of acousticness as compared to the other two. The duration of Williams’ music is also longer in general as compared to the other two composers.The speechiness surprised me at first because I didn’t notice any speech when I listened to the corpus, but after I analyzed the corpus it became clear that Spotify incorrectly gave a certain speechiness score to Goransson. This gives the classifier some information on how to classify Goransson. Last ofcourse the popularity. As seen in this portfolio, Williams popularity is considerably less as compared to Kiner and Goransson. Therefore, it is possible to classify Williams’ music by only considering the popularity of the tracks. In general I think we can say that the Random Forest Classifier did a great job in finding the important features to distinguish all three composers.

Evaluation of our Randomforest Classifier


Here you find the confusion matrices of the randomforest classifier. As you can see, it was easiest for the classifier to distinguish Goransson’s music. This surprises me since I feel like Williams is standing out way more than the other two so you would think that he is the easiest to classify. The accuracy of the randomforest classifier with all its features was good at first, around 0.73. After I selected the, in my opinion, important features the accuracy went way up. These important features are the one you saw on the randomforest tab. I mostly deselected ID’s, names, images and all those other features that measured when I added the song to the playlist. I actually don’t think it is possible to improve these results because the data of Goransson and Kiner are very similar, which makes it harder to classify them. If I listen to their music I can partly distinguish Goransson’s music because I watched The Mandalorian, but that is around it. There is a lot more ‘war music’ going on in Kiner’s tracks, but Spotify didn’t capture this in its features. Overall I think the randomforest classifier did a great job.

Contribution

The Finale

The goal of this analysis was to look into the differences between the composers of Star Wars music and to see if it is possible to distinguish their music from each other. This is definitely possible, mostly based on track-level features. As we have seen in the randomforest classifier, based on namely the Acousticness, Duration, Speechiness and Popularity, it is possible to identify the composer.

Williams stands out in this analysis, he uses a lot of acousticness in his compositions, and is the least popular composer according to spotify. His pitch and timbre cepstrograms are usually a mess, as compared to Kiner and Goransson. Williams varies a lot in key, and his songs usually last the longest. Lastly, most of the tempo he uses in his compositions are below 100 BPM.

Kiner constantly falls in between the two composers. Based on the track-level features there is little to tell about Kiner, but when looking at the music structure, most of Kiner’s songs are easily divided into parts. As compared to the other two composers, Kiner uses the key C the most. Kiner also varies more in tempo usage, having composed the slowest song and the fastest. Kiner is also the only composer who has a song that doesn’t sound like a Star Wars song and where the tempo can actually be identified using a tempogram.

Goransson stands out on some features. Goransson has the highest popularity score. If the popularity is 30 or above, there is a 67% chance that it is a song composed by Goransson. He also stands out in speechiness. If the speechiness is 0.5 or above, there is a 76% chance it is a song composed by Goransson. The music structure of Goransson overall is not very good, but also not very bad. He falls in between Williams (bad structure) and Kiner (good structure). Regarding key, Goransson tends to go towards F and G. Goransson’s music is very average if you look at tempo and duration. The mean tempo is the same as the mean tempo of Kiner, and duration falls between Williams and Kiner.

In our classifier tab we saw that Goransson was identified 92% correctly, Kiner 91% and Williams 90%%. These are all very good estimations. These conclusions are mostly interesting for fans of the Star Wars franchise. I don’t think there is a certain social relevance, but it is fun to see the differences between the composers.

Fig 2: Star Wars Rebels

Fig 2: Star Wars Rebels

For further exploration one could analyse the different Star Wars series by also adding the Star Wars Rebels series composed by Kevin Kiner. This would allow for adding all the Star Wars movies and both seasons of The Mandalorian.